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O ■ Abstract 

^ ■ The computer reahzation of a "sense of humour" requires the creation of 

ff^ . an algorithm for solving the "linguistic problem", i.e. the problem of rec- 

'""' ! ognizing a continuous sequence of polysemantic images. Such an algorithm 

I— 1 1 can be realized in the Hopfield model of a neural network, if it is suitably 

w ■ modified. 

d : 

• ^ . In [1] we analysed the general algorithm of information processing and showed that on 

I ' fulfillment of the natural requirements imposed by its biological purpose such an algorithm 

i__i| will possess a "sense of humour". The present paper proposes a possible realization of the 

' algorithm in a system of formal neurons. 

>'■ 

^ '. 

O ■ Description of the model 

(N; 

T-H I Following Hopfield [2] we shall consider that the state of the i-th neuron is described 

Pr ■ by the variable Vi assuming two values: Vi = 1 (excited state) and V^ = (state of rest). 

r* ■ The link of the neuron i with the neuron j is determined by the parameter Tjj. The state 

• 1— I ! of the system changes with time t according to the algorithm: 

><; 

■ - ' V,{t + St) = l + ^sign I Y: T^Mt) -u\, (1) 

where Ui is the excitation threshold of the i-th neuron and the number i is chosen randomly. 
The proposed model of the nervous system is a modification of the trilayer perceptrone 
[3] adapted for work in real time. It contains the following elements (Fig. 1). 
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The associative memory (A -layer) represents the neural network which for simphcity 
we consider as described by the Hopfield modeh the neurons of the A -layer are linked 
with each other with Tij = Tji, Ui = 0. Within the A- layer evolution according to (1) 
leads from the arbitrary initial state {Vi} to one of the local minima of energy 

E = -J2T^JV,V,, (2) 

which are identified with the images written in the memory {V^j?}, s = 1,2, ... ,p. The 
recorded images determine the matrix of the links Tij [2]: 

T,^ = J2f^s{2Vf-l){2Vf-l), 1^3 (/i. >0), T,, = (3) 

s=l 

The sensory system (S -layer) receives signals from the outside world (for example, 
from the retina of the eye). The sensory neurons are not linked between themselves, but 
each S*- neuron is bound to one of the memory neurons: the links S ^ A are positive 
(exciting) and the back links A ^ S are negative (inhibiting) (Fig. 1). 

The reacting system (R -layer) consists of a set of neurons each of which corresponds 
to one of the images recorded in the memory: to the s-th neuron of the i?- layer converge 
the positive (exciting) links from those neurons i of the memory for which V^ = 1 (we shall 
refer the last neurons as a carrier of the image {^/}). The back negative (inhibiting) links 
run from the i?- neurons to the memory neurons; the i?- neurons are not linked between 
themselves (Fig. 1). The thresholds Ui for the i?- neurons are adjusted in such a manner 
that the excitation of the s-th neuron of the i?- layer occurs only when the configuration 
of the A- neurons is sufficiently close to the image {l^/}. 

We consider that the image is realized by the biological individual only when there is 
excitation of the corresponding i?-neuron, i.e. the i?- layer represents the consciousness 
of the individual. 

The centre coordinates the work of the system by acting according to the built-in pro- 
gram: it exercises control over the macroscopic parameters of the system. The concrete 
functions of the centre consist in the following. 

(1) The centre has links with a small part of the memory neurons evenly distributed 
in the A -layer which allows it to register (a) the presence of excited neurons in a certain 
portion of the memory and (b) the stationarity or nonstationarity of this portion. 

(2) The centre produces local changes of "temperature" in the A -layer. Since the 
temperature of the neural network is determined by the noise level in it (which can be 
taken into account by introducing the random force fi{t) into the braces (1)) then the 
regulatable noise source should be at the disposal of the centre. 

(3) The centre produces local switching on the "magnetic field" in the A -layer which 
corresponds to the addition in energy (2) of the term 

E h^V, (4) 
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Figure 1: Proposed model of the nervous system is a modification of the three -layer 
perceptrone [3]: filled circles are neurons; solid lines are constantly acting links between 
neurons; broken lines are links switched on the command from the centre; the symbols 
+ and — indicate the excitatory and inhibitory character of the links; the S*- layer is the 
sensory system; the A -layer is the associative memory; the /?- layer is the reacting system 
or "consciousness". Roman numerals mark: (/) information from the A -layer, (//) the 
control of the links, (///) the outer world, {IV) the motor cortex. Inset in the right upper 
corner shows possible realization of the controllable link. 



(in (1) hi are added to the thresholds Uj). Switching on the field is achieved with the aid 
of a "magnet" — group of neurons controlled from the centre, each of which is connected 
with a certain region of the A -layer. 

(4) The centre carries out the control of the links shown in Fig. 1 by a broken line. The 
simplest realization of the controllable link AB is possible with the aid of the intermediate 
neuron C (see Fig. 1), the threshold of which is so adjusted that it is excited only in the 
simultaneous presence of the exciting signal from the neuron A and from the centre. In 
the presence of a signal from the centre the neuron A excites or inhibits the neuron B — 
the link is switched on, in the absence of a signal from the centre the neuron A cannot act 
upon the neuron B — the link is switched off. The command for switching on and off is 
given not to the individual links but simultaneously to their large groups. 

(5) The centre gives the command for the learning of the plastic links. 



Recognition of a separate image 

Recognition of the images occurs with the links A ^ S and R -^ A switched on. In the 
initial state of the system all the neurons are not excited. Since the state of the A - layer 
with Vi = is unstable (see (1) at f/j = 0), so the presence of the stabilizing magnetic field 
is necessary. 

Let the stimulus B = B + 6B (i.e. the "noisy" image B) is presented to the sensory 
system; this induces excitation of some of the S'-neurons (Fig. 2, a). Then the centre 
switches out the magnetic field and excitation is transmitted to the A -neurons (Fig. 2,b) 
which, in turn, quench the sensory neurons (Fig. 2,c) (it is assumed that the links S —^ A 
and A ^ S are suffiiciently strong). Then in the A -layer there is free evolution according 
to (1) which ends in relaxation to the stable state corresponding to the image B (Fig. 2,d); 
the neuron responsible for this image is excited in the i?- layer (Fig. 2,e). The back signal is 
send into the memory, quenching the excited neurons (Fig. 2,f) and the image B is deleted 
from the "consciousness" (Fig. 2,g): the system returns to the initial state and is ready for 
the perception of a new image. 



Learning 

The links between the A - neurons are plastic and change according to the rule [2] 

6T,,^{2V,-l){2V,-l)6t (t^j), (5) 

if the neurons i and j stay in the states Vi and Vj during the time 6t. If in the initial 
state Tij = then the presentation to the system of p configurations {Vf}, s = 1,2, ...,p 
leads to the matrix of links (3). Since T^j may have any sign, the neurons of the A -layer 
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Figure 2: Sequential states of the system in the course of recognition of a separate image; 
open circles (J) are excited neurons, dark circles {II) — not excited ones. For clarity, only 
the links are shown along which excitation was transmitted at the preceding moment of 
time. 
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Figure 3: Learning occurs with the S ^ A and R ^ A hnks switched off. 

must have both exciting and inhibiting synapses (speciahzation of the synapses is known 
[4, pp. 62-65] to have invariant character). 

The hnks A ^ R in the initial state have a zero value and can be learned only in one, 
positive direction 
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i.e. have only exciting synapses. The remaining links {R -^ A, S ^ A, A ^ S) are not 
plastic and have inborn character. 

The learning of the system is similar to the learning of a child: a certain image B is 
presented to it generating in the S*- layer a certain configuration of excited neurons; then 
it is asked to "memorize B" , which excites one of the neurons of the i?- layer appointed 
to be responsible for the image B. Learning occurs with the A ^ S and R ^ A links cut 
out (Fig. 3), so that the configuration of the S*- neurons is projected into the A -layer and 
persists for a certain time: the links Tij in the A -layer change according to (5) forming 
the matrix (3), while the links A ^ R change according to (6) ensuring the connection of 
the s-th neuron of the i?- layer with a carrier of the image {V^/}. 

In the resting state Vi = the system spends a considerable time and the learning 
according to (5) should result in "ferromagnetic" interaction between the neurons {Tij > 
for all i,j) and the single state Vi = 1 being stable. Therefore, we suppose that there is 



no learning in the state Vi = 0, and the learning command is given only on presentation of 
the image. 



Short-range action of the links and localization of images 

In the usual Hopfield model [2] all the neurons are linked with each other and the 
carriers of images {V^'^} are spread out over the whole neural net. In reality the links Tij 
have a finite radius of action ^■. in the human brain each neuron has ~ 10^ synapses, while a 
complete number of neurons ~ 10^^ [4, pp. 31-33]. Experimental indications on localization 
of the images also exist [3, pp. 64, 65]. 

We shall consider that each image {V^*} is written in a certain region Qg containing many 
neurons, but small as compared with the size of the whole neural network; so that Vf = 
for i not belonging to Qg (the carrier {Vf} is localized in fig) and for i & fig the magnitudes 
Vf with equal probability assume the values and I.g Since recognition of the images in 
any event must be preceded by translational shift, rotation and change of the scale (which 
can be achieved by a certain modification of the Hopfield model [5]) then the assumption 
on the localization of images does not have any serious consequences. The command for 
switching off the magnetic field, change in temperature (see below) and learning of plastic 
links is given only on presentation of the image {Vf} and only for neurons of the region 

Taking all this into account, we accept that the learning rule for the A -neurons instead 
of (5) has the form 

STi, ~ A, StS]{2V, - 1) {2V, - 1) 5t (z ^ j) (7) 

when the image {Vf} is presented; here 

^ ^ f 1, for rij<^, i^j ^^ = / ^' ^°^ i G fis /gN 

^-^ 1 0, in other cases ' * 1 0, for i ^ Vlg 

and Tij is the distance between the neurons i and j. The matrix of the links after writing 
p images instead of (3) assumes the form 

T,, = A, E St 6]fig{2V; - 1)(21A^ - 1), /i, > 0. (9) 

s=l 

To demonstrate stability of {Vf} let us compose the combination [2] 

H! = l:T^JVf = Y.f^s'Sfi2Vf - 1) E D^^Vf{2Vf - 1), (10) 



^ To raise the stability of the system relative to the destruction of some of the neurons the region fig 
can be multiply connected. 



where Qss' is the intersection of Qg and Qg/. By virtue of the randomness of Vf and the 
large number of terms in the sum for j the latter is close to its mean: 

^r = ^M^E A,(2K/-i) (11) 

For i & Qg the sum over j is positive and 6^ = 1 so that the configuration of {V^*} is stable 
by virtue of the algorithm (1); for i ^ Qg the state V^ = is maintained by the magnetic 
field. 

Because of the localization of the images it suffices for the s-th neuron of the R - layer 
to have links only with the neurons of the region Qg of the A - layer. 



Recognition of the ambiguous image 

Above it was assumed that the stimulus presented, B = B + 6B, is close to image B 
contained in the memory so that the initial state B always relaxes to the final state B, 
i.e. the interpretation of the image B is clearcut. In terms of energy this means that the 
state B lies in a potential well, whose minimum corresponds to the image B and evolution 
occurs at zero temperature. 

Now let us consider the recognition of an ambiguous image. We have in mind the 
modelling of the following situation: the word B is shown to the person and it is explained 
to him that this word may assume several meanings: in the memory of the person the images 
B + bi, B + b2, ■ ■ ■ are fixed where hi, 62, • • • are the elements of the given explanation; if 
now the word B is presented for recognition its interpretation will be ambiguous leading to 
one of the results B + bi. For modelling it suffices to assume that recognition begins at the 
temperature T exceeding the potential barriers between images B + bi, B + b2, ■ ■ ■ and then 
the temperature decreases and the system relaxes to one of the local minima corresponding 
to the images B + bi. Hereafter, we shall consider that decrease of temperature from 
the initial value Tq occurs adiabatically so that the system with overwhelming probability 
relaxes to the deepest of the local minima. 



Simultaneous recognition of several images 

Suppose that after presentation of the stimulus A the system relax to one of the images 
Ai, A2, ■ ■ ■ and after presentation of the stimulus B to one of the images Bi, B2, ■ ■ ■ What 
will happen, if the stimuli A and B are presented simultaneously? 

The process of recognition begins with switching out the magnetic field in the portion 
of the A -layer, where the image carrier is localized. Let presentation of the images A 
and B requires switching out the field in the regions Q^ and Qb of the associative layer. 
Let us assume for simplicity that the regions Qa and Qb do not overlap (the qualitative 
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Figure 4: Potential relief for recognition of the images A and B separately (continuous 
curve) and for simultaneous recognition (broken line). 



picture persists in the general case); then Vi = V/^ + Vj^ where V/^ and V^^ differ from zero 
respectively for the neurons i lying in the regions Qa and Qb- The energy (2) assumes the 
form 

E{V,^ + Vf} = - E T^,V,^Vf - Y: T^,V,^Vf - 2 ^ T^.V/'Vf (12) 

ij ij ij 

If the stimuli A and B are presented separately, then correspondingly V^^ = or V/^ = 0, 
so that only the first or second term remains in the right part of (12). The third term in 
(12) shows that with the simultaneous presentation of the stymuli A and B, they exert 
on each other an effect equivalent to the presence of a magnetic field. The result of such 
interaction is particularly clear if the configurations Ai, A2, . ■ . and respectively Bi, B2, . . . 
are almost degenerate. If the links Tij between the regions Qa and Qb are absent then 
the equilibrium states of the system have the form {As,Bs') and are almost degenerate. 
Inclusion of the weak links Tij between the regions Qa and Qb does not change significantly 
the equilibrium configurations, but changes the relative position on the energy scale. If on 
separate recognition of the images A and B certain configurations (e.g. Ai and B2 in Fig. 4) 
are energetically advantageous, then on their simultaneous recognition other configurations 
may prove more advantageous (e.g. A2 and Bi). It may be visually imagined that the 
interaction change the potential relief where A and B relax (Fig. 4). As a result, the 
interpretation of the polysemantic image proves to depend on the context. 

In the preceding paper [1], the existence of an algorithm was postulated for the recog- 




Figure 5: Recognition of a continuous sequence of images Ai, A2, ■ . .: in the associative 
layer there is sequential excitation of the regions Qi, Q2, ■ ■ ■, which looks fike "diffusion". 

nition of A^ simultaneously presented polysemantic images; now it is clear that such an 
algorithm can be naturally realized in the Hopfield model in the following conditions: (a) 
recognition begins at finite temperature Tq which then adiabaticatty diminishes; (b) the 
potential barriers between the meanings of the polysemantic image are less than Tq; (c) the 
potential bariers between the different polysemantic images are considerably greater than 
Tq] and (d) the interaction between the images is sufficiently weak. 



Recognition of a continuous sequence of images 

Above we assumed that all the images {Vf} have localized carriers. Due to short- 
range nature of links Tjj, the associatively-related images should have closely positioned 
or overlapping carriers, while the uncorrelated images have the carriers localized in the 
remote parts of the memory. Therefore, presentation of a continuous sequence of stimuli 
Ai, A2, ■ ■ ■ give rise to sequential excitation of the regions Qi, Q2, • • • in the associative layer, 
which looks like "diffusion" (Fig. 5): the nearest in succession images are correlated and 
their carriers form conglomerations, whereas to the remote images in sequence correspond 
carriers remote in space, as a result of weakening of the correlations. It gives the possibility 
to establish the correspondence of the time of appearance of the image with the position 
of its carrier in space, which greatly simplifies the work of the centre. After fixing the 
appearance of the excited neurons in a certain portion of the A -layer, the centre raises 
the temperature of this portion to the value Tq, and after the characteristic time tq begins 
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its adiabatic decreasing. This stimulates establishment of the steady state, the bringing of 
the corresponding images into the i?- layer and the zeroing of the corresponded portion 
of the memory (Fig. 2,d-g). Thereby, the trajectory of the "diffusion" movement (Fig. 5) 
is "wiped off' after a certain time so that at each moment only its finite segment exists 
in the memory; recognition of a continuous sequence of images is thereby reduced to the 
simultaneous recognition of a finite number of images (see above). 



Time delays and humorous effect 

Let at the moment of time t = the neurons are excited in a certain portion of memory 
(Fig. 2,b), at the moment tq the corresponding image is brought into consciousness (Fig. 2,e) 
and at the moment ti the portion of memory under consideration is zeroed (Fig. 2,f). The 
delay tq corresponds to the interval AC and the delay ri to the interval AB in Fig. 2 of the 
paper [1]; the latter is related to the fact that the possibility of reinterpreting the image 
persists until the corresponding portion of memory is nullified. Obviously, Ti > Tq: this 
result was derived in [1] from requirements of the optimal processing of the algorithm, but 
now it holds due to the constructive features of the model. 

The delay tq is determined by the rate of decreasing of temperature (see above), while 
the delay ri by the moment of activation (from the centre) of the R ^ A links. The optimal 
choice of the parameters Tq and Ti is determined by different principles: the parameter 
tq corresponds to the delay from the moment information is received by the brain till 
its appearance in consciousness and is upwardly limited by the value r^ax p[], while the 
parameter ri regulates the loading of memorylj, i.e. the fraction of excited neurons in the 
A -layer (Fig. 5). This fraction should not be too small for the operative possibitities of 
the brain to be used in full, and not too large for interference of the images arriving at 
different times not to appear. 

Let at the moment t = the image A enters the memory; evolution in the corresponding 
potential relief (Fig. 4, continuous curve) leads at the moment t = tq to stabilization of the 
memory neurons in the configuration Ai and excitation of the corresponding i?- neuron. 
Let in the interval between tq and ri new image B enters the memory and changes the 
potential relief for A (Fig. 4, broken line). If the temperature at this moment is sufficient 
to overcome the barrier, the system begins to relax to the configuration A2 (in fact, leaving 
of the state Ai is possible even at T = since the minimum corresponding to Ai may 
disappear). Such breaking of stationarity in a certain memory region after exciting of the 
corresponding i?- neuron is considered as a sign of the humorous effect [1]. The image Ai 
(or, in the general case, a version consisting of several images) is realized as "false" and 
should be immediately deleted from consciousness; however, it cannot be done in the course 
of the usual routine (Fig. 2,e,g) because of the need to obtain a new steady configuration 
A2. 



^ Unlike the general case [1] , in the present model there is no special operative memory. 
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Mechanism of laughter 

The "emergency" deletion of the false version from the i?- layer is achieved by activation 
of the links between the i?- layer and the motor cortex (Fig. 1); excitation of the R 
neurons is transmitted to the motoneurons and produce the contraction of certain muscles, 
i.e. laughter. 

In fact we return to the old idea by G. Spencer [6] that the humorous effect is accompa- 
nied by release from the mental process of nervous energy which is directed at the muscular 
reaction. This idea was supported by Darwin [7] and Freud [8] but was criticized by later 
investigators [9] in view of the difficulty to introduce the concept of "nervous energy". 
In fact, the definition of energy for neural networks may be given only under condition 
Tij = Tji [2], which is not very realistic and does not hold for the considered model as a 
whole; so the concept of "nervous energy" should not be taken seriously. Nevertheless, the 
qualitative picture agrees with Spencer's hypothesis: it looks as if the energy of excitation 
is expelled from the i?- neurons into the motor cortex. 

The release of nervous energy in the presence of the humorous effect was validated by 
Spencer using the concept of "descending incongruity" — transition from a high to low 
style, i.e. from the state with rich associations to the state with poor associations. Such an 
interpretation of the humorous effect is surely limited and cannot lay claim to universality. 
In the proposed scheme the "release of nervous energy" (in the conditional sense indicated 
above) is related with the need to delete the false version from consciousness, which requires 
"zeroing" of a certain portion of the i?- layer (i.e. transition of the excited neurons to the 
nonexcited state). 

Since laughter is interpreted as an unconditioned reflex to the humorous effect, the 
known cases of "ousting" of laughter by secondary emotions require an explanation. Laugh- 
ter may be ousted by the emotions of indignation (an indecent anecdote is told to a puritan), 
fear (a bush suddenly turns ouf to be a bear), compassion (a man in front of you slips on 
a water melon rind and badly hurts himself), shame (you slipped on a water melon rind) 
and so on. Within the Spencer hypothesis [6] all these instances are explained by the fact 
that the "released nervous energy" is directed not to the motoneurons but to other parts 
of the nervous system and goes on the formation of a secondary emotion (the i?- layer is 
connected not to the motor cortex but to the limbic system). The same ideas [8] are used 
to explain the known fact that a joke produces the greatest effect if it is told extremely 
laconically: laconicity reduces the probability of formation of secondary associations liable 
to absorb the "nervous energy". 

Casting the excitation of the neurons into different portions of the motor cortex, a man 
can regulate the level of the muscular reaction: this can explain its dependence on mood, 
the psychological setting, the presence of a laughing enviroment [10] and so on. 
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Conclusion 

The realization of a sense of humour requires a quite intricately organized system. We 
would emphasize, however, that this complex organization is entirely governed by the task 
of treating a continuous sequence of polysemantic images; the existence of the humorous 
effect is a secondary consequence. It is well-known [4, pp. 219-241], that different parts 
of the brain have their own specialization; the proposed model may lay claim to be a 
description of only the region of the brain where the linguistic functions are concentrated 
(so-called Broca and Wernicke zones); other portions of the brain may have a different 
organization. 

The author is grateful to D. S. Chernavskii for fruitful discussions. 
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